GeneGAN: Learning Object Transfiguration and Attribute Subspace from Unpaired Data

نویسندگان

  • Shuchang Zhou
  • Taihong Xiao
  • Yi Yang
  • Dieqiao Feng
  • Qinyao He
  • Weiran He
چکیده

Object Transfiguration replaces an object in an image with another object from a second image. For example it can perform tasks like “putting exactly those eyeglasses from image A on the nose of the person in image B”. Usage of exemplar images allows more precise specification of desired modifications and improves the diversity of conditional image generation. However, previous methods that rely on feature space operations, require paired data and/or appearance models for training or disentangling objects from background. In this work, we propose a model that can learn object transfiguration from two unpaired sets of images: one set containing images that “have” that kind of object, and the other set being the opposite, with the mild constraint that the objects be located approximately at the same place. For example, the training data can be one set of reference face images that have eyeglasses, and another set of images that have not, both of which spatially aligned by face landmarks. Despite the weak 0/1 labels, our model can learn an “eyeglasses” subspace that contain multiple representatives of different types of glasses. Consequently, we can perform fine-grained control of generated images, like swapping the glasses in two images by swapping the projected components in the “eyeglasses” subspace, to create novel images of people wearing eyeglasses. Overall, our deterministic generative model learns disentangled attribute subspaces from weakly labeled data by adversarial training. Experiments on CelebA and Multi-PIE datasets validate the effectiveness of the proposed model on real world data, in generating images with specified eyeglasses, smiling, hair styles, and lighting conditions etc. The code is available online. c © 2017. The copyright of this document resides with its authors. It may be distributed unchanged freely in print or electronic forms. ar X iv :1 70 5. 04 93 2v 1 [ cs .C V ] 1 4 M ay 2 01 7 2 STUDENT, PROF, COLLABORATOR: BMVC AUTHOR GUIDELINES (a) Object Removal (b) Object Transplanting Figure 1: (a) Encoder of GeneGAN decomposes an image to the background feature A and the object feature u. The decoder can reconstruct an image without the object (a nonsmiling face), from background feature A and the zero object feature (denoted as 0). (b) Decomposed object feature can be used to transplant the object to another image. When the “smiling” feature u, which is from the first image Au, and the background feature B are fed to a decoder, the generated image Bu would ideally have the same level and style of smiling as Au.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Attention-GAN for Object Transfiguration in Wild Images

This paper studies the object transfiguration problem in wild images. The generative network in classical GANs for object transfiguration often undertakes a dual responsibility: to detect the objects of interests and to convert the object from source domain to target domain. In contrast, we decompose the generative network into two separat networks, each of which is only dedicated to one partic...

متن کامل

Image Classification via Sparse Representation and Subspace Alignment

Image representation is a crucial problem in image processing where there exist many low-level representations of image, i.e., SIFT, HOG and so on. But there is a missing link across low-level and high-level semantic representations. In fact, traditional machine learning approaches, e.g., non-negative matrix factorization, sparse representation and principle component analysis are employed to d...

متن کامل

Image alignment via kernelized feature learning

Machine learning is an application of artificial intelligence that is able to automatically learn and improve from experience without being explicitly programmed. The primary assumption for most of the machine learning algorithms is that the training set (source domain) and the test set (target domain) follow from the same probability distribution. However, in most of the real-world application...

متن کامل

A New Guideline for the Allocation of Multipoles in the Multiple Multipole Method for Two Dimensional Scattering from Dielectrics

A new guideline for proper allocation of multipoles in the multiple multipole method (MMP) is proposed. In an ‘a posteriori’ approach, subspace fitting (SSF) is used to find the best location of multipole expansions for the two dimensional dielectric scattering problem. It is shown that the best location of multipole expansions (regarding their global approximating power) coincides with the med...

متن کامل

Clustering based on Dirichlet mixtures of attribute ensembles

We propose a model-based approach to identifying clusters of objects based on subsets of attributes, so that the attributes that distinguish a cluster from the rest of the population, called an attribute ensemble, may depend on the cluster being considered. The model is based on a Pólya urn cluster model, which is equivalent to a Dirichlet process mixture of multivariate normal distributions. T...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1705.04932  شماره 

صفحات  -

تاریخ انتشار 2017